Chemically Cleavable 3&#39;-O-Allyl-DNTP-Allyl-Fluorophore Fluorescent Nucleotide Analogues and Related Methods

ABSTRACT

This invention provides a nucleotide analogue comprising (i) a base selected from the group consisting of adenine, guanine, cytosine, thymine and uracil, (ii) a deoxyribose, (iii) an allyl moiety bound to the 3′-oxygen of the deoxyribose and (iv) a fluorophore bound to the base via an allyl linker, and methods of nucleic acid sequencing employing the nucleotide analogue.

The invention disclosed herein was made with Government support under Center of Excellence in Genomic Science grant number IP50 HG002806-01 from the National Institutes of Health, U.S. Department of Health and Human Services. Accordingly, the U.S. Government has certain rights in this invention.

Throughout this application, various publications are referenced in parentheses by number. Full citations for these references may be found at the end of each experimental section. The disclosures of these publications in their entireties are hereby incorporated by reference into this application to more fully describe the state of the art to which this invention pertains.

BACKGROUND OF THE INVENTION

With the completion of human genome project, there is now a focus on developing new DNA sequencing technology that will reduce the cost of sequencing dramatically without sacrificing accuracy, which will ultimately enable personalized medicine in healthcare (1). Current state-of-the-art DNA sequencing technology faces limitation in terms of cost, read-length, and throughput. In this regard, DNA sequencing by synthesis (SBS), where the identity of each nucleotide is detected immediately after its incorporation into a growing strand of DNA in a polymerase reaction, offers an alternative approach to address some of these limitations. An important requirement for the SBS approach is a 3′-OH capped fluorescent nucleotide that can act as a reversible terminator (2), where after the identification of the nucleotide incorporated in a DNA polymerase reaction, the 3′-OH capping group along with fluorescent label are removed to regenerate a free 3′-OH group thus allowing DNA chain elongation. The importance of removing the fluorescent label after each base identification is to make sure that the residual fluorescence from the previous nucleotide incorporation does not affect the identification of the next incorporated fluorescent nucleotide.

The speed and sequence read length of SBS depend on the yield of the cleavage efficiency of the fluorophore and the allyl group. Due to multiple steps required in the identification, removal of fluorescent label, and regeneration of 3′-OH group after each nucleotide incorporation in SBS, the loss of even a minor efficiency at each step may lead to inhibition of prolonged read length. For this reason, any improvement in efficiency within each cycle of nucleotide identification, fluorophore removal, and 3′-OH regeneration can have significant impact on read length, thus tackling the physical limits in DNA sequencing by synthesis.

SUMMARY

This invention provides a nucleotide analogue comprising (i) a base selected from the group consisting of adenine or an analogue of adenine, guanine or an analogue of guanine, cytosine or an analogue of cytosine, thymine or an analogue of thymine and uracil or an analogue of uracil, (ii) a deoxyribose, (iii) an allyl moiety bound to the 3′-oxygen of the deoxyribose and (iv) a fluorophore bound to the base via an allyl linker.

This invention also provides a method for making a nucleotide analogue wherein the nucleotide analogue comprises (i) a base selected from the group consisting of adenine or an analogue of adenine, guanine or an analogue of guanine, cytosine or an analogue of cytosine and uracil or an analogue of uracil, (ii) a deoxyribose, (iii) an allyl moiety bound to the 3′ oxygen of the deoxyribose, and (iv) a fluorophore bound to the base via an allyl linker which is not an iso-allyl linker, comprising the steps of:

-   -   (a) contacting 6-amino-hex-2-en-1-ol and an N-hydroxysuccinimide         ester of a fluorophore in the presence of a first suitable         solvent and a suitable base;     -   (b) treating the resulting product of step (a) with DSC/Et₃N in         a second suitable solvent; and     -   (c) treating the resulting product of step (b) with a         3′-O-allyl-dNTP-NH₂ in the presence of a suitable buffered         solvent, wherein the base of the 3′-O-allyl-dNTP-NH₂ is an         adenine, guanine, cytosine, uracil, or an analogue thereof,         thereby making the nucleotide analogue.

This invention also provides a method for making a nucleotide analogue wherein the nucleotide analogue comprises (i) a base selected from the group consisting of adenine or an analogue of adenine, guanine or an analogue of guanine, cytosine or an analogue of cytosine and uracil or an analogue of uracil, (ii) a deoxyribose, (iii) an allyl moiety bound to the 3′ oxygen of the deoxyribose, and (iv) a fluorophore bound to the base via an allyl linker, comprising the steps of:

-   -   (a) contacting 2-(2-amino-ethyl)-prop-2-en-1-ol and an         N-hydroxysuccinimide ester of a fluorophore in the presence of a         first suitable solvent and a suitable base;     -   (b) treating the resulting product of step (a) with DSC/Et₃N in         a second suitable solvent; and     -   (c) treating the resulting product of step (b) with a         3′-O-allyl-dNTP-NH₂ in the presence of a suitable buffered         solvent, wherein the base of the 3′-O-allyl-dNTP-NH₂ is an         adenine, guanine, cytosine, uracil, or an analogue thereof,         thereby making the nucleotide analogue.

This invention also provides a method for determining the sequence of a DNA comprising performing the following steps for each residue of the DNA to be sequenced:

-   -   (a) contacting the DNA with a DNA polymerase in the presence         of (i) a primer and (ii) four fluorescent nucleotide analogues         under conditions permitting the DNA polymerase to catalyze DNA         synthesis, wherein (1) the nucleotide analogues consist of an         analogue of dGTP, an analogue of dCTP, an analogue of dTTP or         dUTP, and an analogue of dATP, (2) each nucleotide analogue         comprises (i) a base selected from the group consisting of         adenine, guanine, cytosine, thymine or uracil, and analogues         thereof, (ii) a deoxyribose, (iii) an allyl moiety bound to the         3′-oxygen of the deoxyribose and (iv) a fluorophore bound to the         base via an allyl linker, so that a nucleotide analogue         complementary to the residue being sequenced is bound to the DNA         by the DNA polymerase, and (3) each of the four analogues has a         predetermined fluorescence wavelength which is different than         the fluorescence wavelengths of the other three analogues;     -   (b) removing unbound nucleotide analogues;     -   (c) determining the identity of the bound nucleotide analogue;         and     -   (d) following step (c), except with respect to the final DNA         residue to be sequenced, chemically cleaving from the bound         nucleotide analogue the fluorophore and the allyl moiety bound         to the 3′-oxygen atom of the deoxyribose,     -   thereby determining the sequence of the DNA.

This invention also provides a kit for performing the instant method comprising, in separate compartments,

-   -   (a) a nucleotide analogue of (i) GTP, (ii) ATP, (iii) CTP         and (iv) TTP or UTP, wherein each analogue comprises (i) a base         selected from the group consisting of adenine, guanine,         cytosine, thymine or uracil, or an analogue thereof, (ii) a         deoxyribose, (iii) an allyl moiety bound to the 3′-oxygen of the         deoxyribose and (iv) a fluorophore bound to the base via an         allyl linker,     -   (b) reagents suitable for use in DNA polymerization; and     -   (c) instructions for use.

This invention also provides a method for covalently affixing a detectable moiety, via an allyl linker, to an NH₂-bearing molecule, comprising contacting the detectable moiety with the NH₂-bearing molecule in the presence of a suitable solvent and suitable base, wherein the detectable moiety comprises a mass tag, fluorophore or chromophore bound to a NHS ester of an allyl moiety. In one embodiment of the NH₂-bearing molecule is a nucleotide and the detectable moiety comprises a fluorophore.

BRIEF DESCRIPTION OF THE FIGURES

FIG. 1: Structures of four 3′-O-allyl-dNTP-PC-fluorophores.

FIG. 2. Synthesis of 3′-O-allyl-dNTP-allyl-fluorophores.

FIG. 3. Polymerase DNA extension reaction using 3′-O-allyl-dGTP-allyl-Bodipy-FL-510 as a reversible terminator.

FIG. 4. A polymerase extension scheme (left) and MALDI-TOF MS spectra of extension and dual-deallylation product (right).

FIG. 5. Structures of four alternative chemically cleavable fluorescent nucleotides, 3′-O-allyl-dNTP-iso-allyl-fluorophore.

FIG. 6. Synthesis of 3′-O-allyl-dNTP-iso-allyl-fluorophore.

FIG. 7. A polymerase extension scheme (left) and MALDI-TOF MS spectra of extension and dual-deallylation product (right).

DETAILED DESCRIPTION OF THE INVENTION Terms

The following definitions are presented as an aid in understanding this invention:

A—Adenine; C—Cytosine;

DNA—Deoxyribonucleic acid;

DMF—Dimethylformamide; G—Guanine; NHS—N-hydroxysuccinimidyl;

RNA—Ribonucleic acid; SBS—Sequencing by synthesis;

T—Thymine; and U—Uracil.

“Nucleic acid” shall mean any nucleic acid, including, without limitation, DNA, RNA and hybrids thereof. The nucleic acid bases that form nucleic acid molecules can be the bases A, C, G, T and U, as well as derivatives thereof. Derivatives of these bases are well known in the art, and are exemplified in PCR Systems, Reagents and Consumables (Perkin Elmer Catalogue 1996 1997, Roche Molecular Systems, Inc., Branchburg, N.J., USA).

As used herein, “self-priming moiety” shall mean a nucleic acid moiety covalently bound to a nucleic acid to be transcribed, wherein the bound nucleic acid moiety, through its proximity with the transcription initiation site of the nucleic acid to be transcribed, permits transcription of the nucleic acid under nucleic acid polymerization-permitting conditions (e.g. the presence of a suitable polymerase, nucleotides and other reagents). That is, the self-priming moiety permits the same result (i.e. transcription) as does a non-bound primer. In one embodiment, the self-priming moiety is a single stranded nucleic acid having a hairpin structure.

“Hybridize” shall mean the annealing of one single-stranded nucleic acid to another nucleic acid based on sequence complementarity. The propensity for hybridization between nucleic acids depends on the temperature and ionic strength of their milieu, the length of the nucleic acids and the degree of complementarity. The effect of these parameters on hybridization is well known in the art (see Sambrook J, Fritsch E F, Maniatis T. 1989. Molecular cloning: a laboratory manual. Cold Spring Harbor Laboratory Press, New York.)

As used herein, “nucleotide analogue” shall mean an analogue of A, G, C, T or U (that is, an analogue of a nucleotide comprising the base A, G, C, T or U) which is recognized by DNA or RNA polymerase (whichever is applicable) and incorporated into a strand of DNA or RNA (whichever is appropriate). Examples of nucleotide analogues include, without limitation 7-deaza-adenine, 7-deaza-guanine, the analogues of deoxynucleotides shown in FIG. 6, analogues in which a label is attached through a cleavable linker to the 5-position of cytosine or thymine or to the 7-position of deaza-adenine or deaza-guanine, analogues in which a small chemical moiety such as —CH₂OCH₃ or —CH₂CH═CH₂ is used to cap the —OH group at the 3′-position of deoxyribose, and analogues of related dideoxynucleotides. Nucleotide analogues, including dideoxynucleotide analogues, and DNA polymerase-based DNA sequencing are also described in U.S. Pat. No. 6,664,079.

1,3 dipolar azide-alkyne cycloaddition chemistry is described in WO 2005/084367 and PCT/US03/39354, the contents of each of which are hereby incorporated by reference.

All embodiments of U.S. Pat. No. 6,664,079 (the contents of which are hereby incorporated by reference) with regard to sequencing a nucleic acid are specifically envisioned here.

With regard to the synthesis of the nucleotide analogues disclosed herein, other fluorophores or chromophores to be cleavably attached to the base of the analogue are envisioned. In addition, combinatorial fluorescence energy tags as described in U.S. Pat. No. 6,627,748 (the contents of which are hereby incorporated by reference) or mass tags may be used in place of the fluorophores described herein.

EMBODIMENTS OF THE INVENTION

This invention provides a nucleotide analogue comprising (i) a base selected from the group consisting of adenine or an analogue of adenine, guanine or an analogue of guanine, cytosine or an analogue of cytosine, thymine or an analogue of thymine and uracil or an analogue of uracil, (ii) a deoxyribose, (iii) an allyl moiety bound to the 3′-oxygen of the deoxyribose and (iv) a fluorophore bound to the base via an allyl linker.

In one embodiment, the nucleotide analogue is an analogue of dATP, dGTP, dCTP or dUTP. In one embodiment, the fluorophore is selected from the group consisting of ROX, Bodipy-FL-510, Bodipy-650 and R6G. In one embodiment, the fluorophore is bound to the base via an iso-allyl linker.

In a further embodiment, the nucleotide analogue is selected from the group consisting of 3′-O-allyl-dGTP-iso-allyl-Bodipy-FL-510, 3′-O-allyl-dCTP-iso-allyl-Bodipy-650, 3′-O-allyl-dATP-iso-allyl-ROX and 3′-O-allyl-dUTP-iso-allyl-R6G, or is selected from the group consisting of 3′-O-allyl-dGTP-allyl-Bodipy-FL-510, 3′-O-allyl-dCTP-allyl-Bodipy-650, 3′-O-allyl-dATP-allyl-ROX and 3′-O-allyl-dUTP-allyl-R6G.

This invention also provides a method for making a nucleotide analogue wherein the nucleotide analogue comprises (i) a base selected from the group consisting of adenine or an analogue of adenine, guanine or an analogue of guanine, cytosine or an analogue of cytosine and uracil or an analogue of uracil, (ii) a deoxyribose, (iii) an allyl moiety bound to the 3′ oxygen of the deoxyribose, and (iv) a fluorophore bound to the base via an allyl linker which is not an iso-allyl linker, comprising the steps of:

-   -   (a) contacting 6-amino-hex-2-en-1-ol and an N-hydroxysuccinimide         ester of a fluorophore in the presence of a first suitable         solvent and a suitable base;     -   (b) treating the resulting product of step (a) with DSC/Et₃N in         a second suitable solvent; and     -   (c) treating the resulting product of step (b) with a         3′-O-allyl-dNTP-NH₂ in the presence of a suitable buffered         solvent, wherein the base of the 3′-O-allyl-dNTP-NH₂ is an         adenine, guanine, cytosine, uracil, or an analogue thereof,         thereby making the nucleotide analogue.

In one embodiment, the steps of the instant method comprise those set forth in FIG. 2, scheme A; FIG. 2, scheme B; FIG. 2, scheme C; or FIG. 2, scheme D.

In one embodiment of the instant method, the first suitable solvent is DMF and the second suitable solvent is DMF, and in another embodiment the first suitable solvent is acetonitrile and the second suitable solvent is DMF. In one embodiment of the instant method, the suitable base is NaHCO₃. In one embodiment of the instant method, the suitable buffered solvent is DMF buffered with NaHCO₃—Na₂CO₃. In another embodiment of the instant method, the suitable buffered solvent is acetonitrile buffered with NaHCO₃—Na₂CO₃.

This invention also provides a method for making a nucleotide analogue wherein the nucleotide analogue comprises (i) a base selected from the group consisting of adenine or an analogue of adenine, guanine or an analogue of guanine, cytosine or an analogue of cytosine and uracil or an analogue of uracil, (ii) a deoxyribose, (iii) an allyl moiety bound to the 3′ oxygen of the deoxyribose, and (iv) a fluorophore bound to the base via an allyl linker, comprising the steps of:

-   -   (a) contacting 2-(2-amino-ethyl)-prop-2-en-1-ol and an         N-hydroxysuccinimide ester of a fluorophore in the presence of a         first suitable solvent and a suitable base;     -   (b) treating the resulting product of step (a) with DSC/Et₃N in         a second suitable solvent; and     -   (c) treating the resulting product of step (b) with a         3′-O-allyl-dNTP-NH₂ in the presence of a suitable buffered         solvent, wherein the base of the 3′-O-allyl-dNTP-NH₂ is an         adenine, guanine, cytosine, uracil, or an analogue thereof,         thereby making the nucleotide analogue.

In one embodiment of the instant method, the steps comprise those set forth in FIG. 6, scheme A; FIG. 6, scheme B; FIG. 6, scheme C; or FIG. 6, scheme D.

In one embodiment of the instant method, the first suitable solvent is DMF and the second suitable solvent is DMF. In one embodiment of the instant method, the suitable base is NaHCO₃. In one embodiment of the instant method, the suitable buffered solvent is DMF buffered with NaHCO₃—Na₂CO₃.

This invention also provides a method for determining the sequence of a DNA comprising performing the following steps for each residue of the DNA to be sequenced:

-   -   (a) contacting the DNA with a DNA polymerase in the presence         of (i) a primer and (ii) four fluorescent nucleotide analogues         under conditions permitting the DNA polymerase to catalyze DNA         synthesis, wherein (1) the nucleotide analogues consist of an         analogue of dGTP, an analogue of dCTP, an analogue of dTTP or         dUTP, and an analogue of dATP, (2) each nucleotide analogue         comprises (i) a base selected from the group consisting of         adenine, guanine, cytosine, thymine or uracil, and analogues         thereof, (ii) a deoxyribose, (iii) an allyl moiety bound to the         3′-oxygen of the deoxyribose and (iv) a fluorophore bound to the         base via an allyl linker, so that a nucleotide analogue         complementary to the residue being sequenced is bound to the DNA         by the DNA polymerase, and (3) each of the four analogues has a         predetermined fluorescence wavelength which is different than         the fluorescence wavelengths of the other three analogues;     -   (b) removing unbound nucleotide analogues;     -   (c) determining the identity of the bound nucleotide analogue;         and     -   (d) following step (c), except with respect to the final DNA         residue to be sequenced, chemically cleaving from the bound         nucleotide analogue the fluorophore and the allyl moiety bound         to the 3′-oxygen atom of the deoxyribose, thereby determining         the sequence of the DNA.

In one embodiment of the instant method, chemically cleaving the fluorophore and the allyl moiety bound to the 3′-oxygen atom is performed using Na₂PdCl₄.

In one embodiment of the instant method, the primer is a self-priming moiety.

In one embodiment of the instant method, the DNA is bound to a solid substrate. In one embodiment of the instant method, the DNA is bound to the solid substrate via 1,3-dipolar azide-alkyne cycloaddition chemistry. In one embodiment of the instant method, about 1000 or fewer copies of the DNA are bound to the solid substrate.

In one embodiment of the instant method, the four fluorescent nucleotide analogues are 3′-O-allyl-dGTP-iso-allyl-Bodipy-FL-510, 3′-O-allyl-dCTP-iso-allyl-Bodipy-650, 3′-O-allyl-dATP-iso-allyl-ROX and 3′-O-allyl-dUTP-iso-allyl-R6G.

In another embodiment of the instant method, the four fluorescent nucleotide analogues are 3′-O-allyl-dGTP-allyl-Bodipy-FL-510, 3′-O-allyl-dCTP-allyl-Bodipy-650, 3′-O-allyl-dATP-allyl-ROX and 3′-O-allyl-dUTP-allyl-R6G.

In one embodiment of the instant method, the DNA polymerase is a 9°N polymerase.

This invention also provides a kit for performing the instant method comprising, in separate compartments,

-   -   (a) a nucleotide analogue of (i) GTP, (ii) ATP, (iii) CTP         and (iv) TTP or UTP, wherein each analogue comprises (i) a base         selected from the group consisting of adenine, guanine,         cytosine, thymine or uracil, or an analogue thereof, (ii) a         deoxyribose, (iii) an allyl moiety bound to the 3′-oxygen of the         deoxyribose and (iv) a fluorophore bound to the base via an         allyl linker,     -   (b) reagents suitable for use in DNA polymerization; and     -   (c) instructions for use.

In one embodiment, the kit comprises 3′-O-allyl-dGTP-allyl-Bodipy-FL-510, 3′-O-allyl-dCTP-allyl-Bodipy-650, 3′-O-allyl-dATP-allyl-ROX and 3′-O-allyl-dUTP-allyl-R6G.

In one embodiment, the kit comprises 3′-O-allyl-dGTP-iso-allyl-Bodipy-FL-510, 3′-O-allyl-dCTP-iso-allyl-Bodipy-650, 3′-O-allyl-dATP-iso-allyl-ROX and 3′-O-allyl-dUTP-iso-allyl-R6G.

This invention also provides a method for covalently affixing a detectable moiety, via an allyl linker, to an NH₂-bearing molecule, comprising contacting the detectable moiety with the NH₂-bearing molecule in the presence of a suitable solvent and suitable base, wherein the detectable moiety comprises a mass tag, fluorophore or chromophore bound to a NHS ester of an allyl moiety. In one embodiment, the NH₂-bearing molecule is a nucleotide and the detectable moiety comprises a fluorophore.

In an embodiment, the allyl is chemically cleaved using a palladium catalyst. In an embodiment the cleaving is performed using Na₂PdCl₄ and TPPTS. In one embodiment the pH is between 8.5 and 9. In a further embodiment the pH is 8.8.

In embodiments of this invention, the sequencing methods described can be applied, mutatis mutandis, to sequencing an RNA molecule or an RNA/DNA hybrid molecule.

This invention will be better understood by reference to the Experimental Details which follow, but those skilled in the art will readily appreciate that the specific experiments detailed are only illustrative of the invention as described more fully in the claims which follow thereafter.

Experimental Details Synopsis

Here, the construction of a novel chemically cleavable fluorescent labeling system based on an allyl group to modify nucleotides for DNA sequencing by synthesis (SBS) is explored. It is found that an allyl moiety can be used successfully as a linker to tether a fluorophore to a 3′-O-allyl-modified nucleotides (A, C, G, U), forming chemical cleavable reversible terminators, 3′-O-allyl-dNTP-allyl-fluorophore, for application in SBS. The fluorophore and the 3′-O-allyl group on a DNA extension product, which is generated by incorporating 3′-O-allyl-dNTP-allyl-fluorophore, are removed simultaneously in 30 sec by Pd-catalyzed deallylation in aqueous buffer solution. This one-pot dual-deallylation reaction thus allows the re-initiation of the polymerase reaction and increases the SBS efficiency. Expansion of this novel linker and selective protection strategy to other applications that include bio-conjugation, solution- and solid-phase organic synthesis is envisaged.

INTRODUCTION

A disulfide group has been previously explored as a chemically cleavable linker to attach a fluorophore to a deoxynucleotide and the use of 2-mercaptoethanol to remove the fluorophore after the nucleotide incorporation and detection in SBS (4). However, the disulfide bond can be reversed and destabilized under certain conditions (5, 6). Here, the construction of a novel chemically cleavable fluorescent labeling system based on an allyl group is disclosed. The discovery permits fluorophore linker cleaving the 3′-O-allyl capping group removal in a single step, thus increasing SBS efficiency. Disclosed here is an allyl moiety that can be used successfully as a linker to tether a fluorophore to a 3′-O-allyl-capped nucleotide, thus forming a set of chemical cleavable reversible terminators, 3′-O-allyl-dNTP-allyl-fluorophores (FIG. 2).

The fluorophore and the 3′-O-allyl group on a DNA extension product which is generated by incorporating structure 4 (FIG. 2) are removed simultaneously in 30 sec by Pd-catalyzed deallylation in aqueous buffer solution. This one-pot dual-deallylation reaction thus allows the re-initiation of the polymerase reaction. Design, synthesis and evaluation of a 3′-O-allyl fluorescent nucleotide, 3′-O-allyl-dGTP-allyl-Bodipy-FL-510 (structure 4) with a fluorophore attached to the 7 position of guanine base via an allyl carbamate linker, and its application as a reversible terminator for SBS is shown in FIG. 3.

Readily available allylic alcohol (structure 1) was chosen as a starting material for the preparation of 4. First, allylic alcohol 1 was reacted with N-hydroxysuccinimide (NHS) ester of the BODIPY-FL-510 to produce allylic-Bodipy-FL-510-NHS (structure 2), which was subsequently converted to its corresponding NHS ester (structure 3) by reacting with N,N′-disuccinimidyl carbonate. The coupling reaction between 3 and the modified nucleotide (3′-O-allyl-dGTP-NH₂) (3) produced the chemically cleavable fluorescent nucleotide, 3′—O— allyl-dGTP-allyl-Bodipy-FL-510 4.

To verify that 3′-O-allyl-dGTP-allyl-Bodipy-FL-510 4, acting as a reversible terminator, is incorporated accurately in a base-specific manner in a polymerase reaction, a polymerase DNA extension reaction was performed in solution as shown in FIG. 3. This allows the isolation of the DNA product at each step for detailed molecular structure characterization by using MALDI-TOF MS as shown in FIG. 4. First, a polymerase extension reaction using 3′-O-allyl-dGTP-allyl-Bodipy-FL-510 4 as a terminator along with a primer and the synthetic 100-mer DNA template corresponding to a portion of exon 7 of the human p53 gene was performed producing a single-base extension product (structure 5). After the reaction, a small portion of the DNA extension product 5 was characterized by MALDI-TOF MS. The rest of the extended DNA product 5 was added to a deallylation cocktail [1× Thermopol reaction buffer/Na₂PdCl₄/P(PhSO₃Na)₃] to perform dual-deallylation in a one-pot reaction for 30 sec to yield deallylated DNA product 6 and characterized by MALDI-TOF MS. The deallylated DNA product with both the fluorophore removed and a free 3′-OH group regenerated can then be used as a primer for a next nucleotide extension reaction.

FIG. 4 (right panel) shows sequential mass spectrum at each step of DNA sequencing by synthesis using 3′-O-allyl-dGTP-allyl-Bodipy-FL-510 4 as a reversible terminator. As can be seen from FIG. 4A, the MALDI-TOF MS spectrum consists of a distinct peak at m/z 6,967 corresponding to the single base DNA extension product 5 with 100% incorporation efficiency, confirming that the reversible terminator 4 can be incorporated base-specifically by DNA polymerase into a growing DNA strand. FIG. 4B shows the one-pot dual-deallylation result after 30 sec incubation of the DNA extension product in a deallylation cocktail solution. The peak at m/z 6,967 has completely disappeared, whereas the peak at m/z 6,512 corresponding to a DNA product 6 with both the fluorophore and 3′-O-allyl removed appears as the sole product. The absence of a peak at m/z 6,967 proves that the one-pot dual-deallylation reaction in removing both the fluorophore and the 3′-O-allyl group from the DNA product was completed with high efficiency. Furthermore, the allyl linker moiety is completely stable in a polymerase extension condition and can be selectively cleaved in a rapid and efficient manner.

Another version of the allyl linker for the construction of the chemically cleavable fluorescent nucleotides as reversible terminators was investigated. The structures of the four molecules (3′-O-allyl-dNTP-iso-allyl-fluorophore) are shown in FIG. 5. The syntheses of these molecules are shown in FIG. 6. As an example, the polymerase extension scheme using 3′-O-allyl-dGTP-iso-allyl-Bodipy-FL-510 as a reversible terminator and the corresponding MALDI-TOF MS results are shown in FIG. 7. The other three chemically fluorescent nucleotides (A, C, U) are similarly characterizable by MALDI-TOF MS.

The experimental results show that the 3′-O-allyl-dNTP-allyl-fluorophore can be faithfully incorporated into a growing DNA strand in a polymerase extension reaction to act as reversible terminators in SBS. This novel linker and selective protection strategy can be applied to other tasks that include bio-conjugation, solution- and solid-phase organic synthesis.

Material and Methods

General Information. ¹H NMR spectra were recorded on a Brucker DPX-400 (400 MHz) spectrometer and are reported in ppm from CD₃OD or CDCl₃ internal standard (3.31 or 7.26 ppm respectively). Data are reported as follows: (s=singlet, d=doublet, t=triplet, q=quartet, m=multiplet, coupling constant(s) in Hz; integration; assignment). Proton decoupled ¹³C NMR spectra were recorded on a Brucker DPX-400 (100 MHz) spectrometer and are reported in ppm from CD₃OD or CDCl₃ internal standard (49.0 or 77.0 ppm respectively). High Resolution Mass Spectra (HRMS) were obtained on a JEOL JMS HX 110A mass spectrometer. Mass measurement of DNA was made on a Voyager DE MALDI-TOF mass spectrometer (Applied Biosystems). The NHS esters of the fluorophores were purchased from Molecular Probes. All other chemicals were purchased from Sigma-Aldrich. 9°N polymerase (exo-) A485L/Y409V was generously provided by New England Biolabs.

Synthesis of chemically cleavable linker: 6-amino-hexen-1-ol (1)

To a mixture of (cis)-2-butene-1,4-diol (440 mg, 5.00 mmol) and (cyanomethyl)-trim ethylphosphonium iodide (1.24 g, 5.10 mmol) (Zaragoza, F. J. Org. Chem. 2002; 67(14), 4963-4964) were added propionitrile (4.0 mL) and N,N′-diisopropylethylamine (DIPEA) (1.10 mL, 6.32 mmol), and the mixture was stirred at 97° C. for 24 h. Water (0.20 mL, 11.1 mmol) was added, and stirring at 97° C. was continued for 15 h. Water (25 mL) and concentrated hydrochloric acid (1.0 mL, 12 mmol) were added, and the mixture was extracted with ethyl acetate (3×25 mL). The combined extracts were washed with brine, dried with Na₂SO₄, and concentrated. The residue was purified by flash chromatography to yield 410 mg (74%) 6-hydroxy-4-hexanenitrile as an oil. ¹H NMR (400 MHz, CDCl₃) δ 5.71 (m, 1H), 5.62 (m, 1H), 4.20 (d, 2H), 2.45 (t, 2H), 2.33 (q, 2H). ¹³C NMR (100 MHz, CDCl₃) δ 129.3, 129.1, 117.7, 60.0, 22.5, 18.3. HRMS m/z: calcd for C₆H₉NO (M+H⁺) 112.068, found 112.082.

To a suspension of LiAlH₄ (380 mg, 10.0 mmol) in THF (50 mL) was added slowly dropwise a solution of 6-hydroxy-4-hexanenitrile (333 mg, 3.00 mmol) in THF (20 mL) while keeping the temperature below 0° C. When the reaction slowed down, the mixture was heated to reflux for 24 h. The excess LiAlH₄ was quenched by addition of 15% sodium hydroxide. The resulting white precipitate was filtered. The filtrate was dried over Na₂SO₄ and concentrated under reduced pressure to yield 6-Amino-hexen-1-ol 1 (296 mg, 85%) as yellow oil. ¹H NMR (400 MHz, CDCl₃) δ 1.54 (q, 2H), 1.82 (brs, 3H, NH₂+OH), 5.71 (m, 1H), 5.62 (m, 1H), 4.20 (d, 2H), 4.07 (d, 2H), 2.70 (t, 2H), 2.09 (q, 2H). ¹³C NMR (100 MHz, CDCl₃) δ 132.3, 129.8, 63.6, 41.8, 32.2, 29.8. HRMS m/z calcd for C₆H₁₄NO (M+H⁺) 117.075, found 117.107.

Allylic-Bodipy-FL-510 (2). Allylic alcohol (1) (3 mg, 0.026 mmol) was dissolved in 550 μl of acetonitrile and 100 μl of 1 M NaHCO₃ aqueous solution. A solution of Bodipy-FL-510 N-hydroxysuccinimidyl (NHS) ester (5 mg, 0.013 mmol) in 400 μl of acetonitrile was added slowly to the above reaction mixture and then stirred for 5 h at room temperature. The resulting reaction mixture was purified on a preparative silica-gel TLC plate (CHCl₃/CH₃OH=95:5) to give pure allylic-Bodipy-FL-510 (2) (4.8 mg, 46%). ¹H NMR (400 MHz, CD₃OD) δ 7.50 (s, 1H), 6.63 (s, 1H), 6.33 (s, 1H), 5.68 (m, 1H), 5.62 (m, 1H), 5.45 (m, 2H), 4.21 (d, 2H), 3.23 (t, 2H), 2.24 (t, 2H), 2.22 (t, 2H), 2.16 (d, 1H), 2.05 (d, 1H), 2.00 (m, 2H), 1.57 (m, 2H). ¹³C NMR (100 MHz, CD₃OD) δ 175.4, 163.7, 146.9, 145.8, 132.1, 130.0, 129.2, 128.7, 123.2, 116.0, 109.2, 101.0, 59.4, 40.4, 34.0, 30.6, 30.3, 24.8, 18.7, 16.9. High-resolution MS (FAB⁺) m/z: anal. calcd for C₂₀H₂₆O₂N₃F₂B (M+H⁺), 390.2086; found, 390.2101.

Allylic-Bodipy-FL-510 NHS ester (3). N,N′-disuccinimidyl carbonate (4.27 mg, 0.017 mmol) and triethylamine (4.6 μl, 0.033 mmol) were added to a solution of allylic-Bodipy-FL-510 (2) (4.8 mg, 0.012 mmol) in 200 μl of dry acetonitrile. The reaction mixture was stirred under argon at room temperature for 6 h. Solvent was removed under vacuum, and 1 ml of 1 M NaHCO₃ aqueous solution was added to the residual mixture. Extracted with ethyl acetate three times, the combined organic layer was dried over Na₂SO₄, which was directly subjected to the following coupling reaction without further purification.

3′-O-allyl-dGTP-allyl-Bodipy-FL-510 (4). Crude allylic-Bodipy-FL-510 NHS ester (3) (6.3 mg) in 300 μl of acetonitrile was added to a solution of 3′-O-allyl-dGTP-NH₂ (2 mg, 0.004 mmol) in 300 μl of Na₂CO₃/NaHCO₃ buffer (0.1 M, pH 8.7). The reaction mixture was stirred at room temperature for 3 h, which was subsequently purified by preparative silica-gel TLC plate (CHCl₃/CH₃OH, 1/1) to remove unreacted allylic-Bodipy-FL-510 NHS ester (3). The crude product was concentrated further under vacuum and purified with reverse-phase HPLC on a 150×4.6-mm C18 column to obtain the pure product 3′-O-allyl-dGTP-allyl-Bodipy-FL-510 (4) (retention time of 35 min). Mobile phase: A, 8.6 mM triethylamine/100 mM hexafluoroisopropyl alcohol in water (pH 8.1); B, methanol. Elution was performed with 100% A isocratic over 10 min, followed by a linear gradient of 0-50% B for 20 min and then 50% B isocratic over another 20 min. 4 was characterized by the following primer extension reaction and MALDI-TOF MS.

Primer extension using 3′-O-allyl-dGTP-allyl-Bodipy-FL-510 (4). The polymerase extension reaction mixture consisted of 60 μmol of primer (5′-GTTGAT-GTACACATTGTCAA-3′) (SEQ ID NO:1), 80 pmol of 100-mer template (5′-TACCCGGAGGC-CAAGTACGGCGGGTACGTCCTTGACAATGTGTACATCAACATCACCTACCACCATGT CAGTCTCGGTTGGATCCTCTATTGTGTCCGGG-3′) (SEQ ID NO:2), 120 pmol of 3′-O-allyl-dGTP-allyl-Bodipy-FL-510 (4), 1× Thermopol reaction buffer (20 mM Tris-HCl/10 mM (NH₄)₂SO₄/10 mM KCl/2 mM MgSO₄/0.1% Triton X-100, pH 8.8, New England Biolabs), and 6 units of 9°N Polymerase (exo-)A485L/Y409V in a total volume of 20 μl. The reaction consisted of 20 cycles at 94° C. for 20 sec, 46° C. for 40 sec, and 60° C. for 90 sec. After the reaction, a small portion of the DNA extension product was desalted by using ZipTip and analyzed by MALDI-TOF MS, which shows a dominant peak at m/z 6,967 corresponding to the DNA product (5). The rest of the product 5 was subjected to the following deallylation.

One-pot dual-deallylation of DNA extension product (5) to produce DNA product (6). DNA product 5 (20 pmol) was added to a mixture of degassed 1× Thermopol reaction buffer (20 mM Tris-HCl/10 mM (NH₄)₂SO₄/10 mM KCl/2 mM MgSO₄/0.1% Triton X-100, pH 8.8, 1 μl), Na₂PdCl₄ in degassed H₂O (7 μl, 23 nmol) and P(PhSO₃Na)₃ in degassed H₂O (10 μl, 176 nmol) to perform a one-pot dual-deallylation reaction. The reaction mixture was then placed in a heating block and incubated at 70° C. for 30 seconds to yield quantitatively deallylated DNA product (6) and analyzed by MALDI-TOF MS to yield a single peak at m/z 6,512.

3′-O-allyl-dUTP-allyl-R6G (7): Crude Allylic-R6G NHS ester (prepared by the same procedure as Allylic-Bodipy-FL-510 NHS ester) (7 mg) in 300 μl of DMF was added to a solution of 3′-O-allyl-dUTP-NH₂ (2 mg, 4 mmol) in 300 μl of Na₂CO₃—NaHCO₃ buffer (0.1 M, pH 8.7). The reaction mixture was stirred at room temperature for 3 h. A preparative silica-gel TLC plate was used to purify the crude product (CHCl₃/CH₃OH, 1/1). The resulting product was concentrated under vacuum and further purified with reverse-phase HPLC on a 150×4.6-mm C18 column to obtain the pure product 10 (retention time of 38 min). Mobile phase: A, 8.6 mM triethylamine/100 mM hexafluoroisopropyl alcohol in water (pH 8.1); B, methanol. Elution was performed with 100% A isocratic over 10 min, followed by a linear gradient of 0-50% B for 20 min and then 50% B isocratic over another 20 min. 3′-O-allyl-dUTP-allyl-R6G was characterized by single-base extension reaction and MALDI-TOF MS similarly as for 4.

3′-O-allyl-dATP-allyl-ROX (8): Crude Allylic-ROX NHS ester (prepared by the same procedure as Allylic-Bodipy-FL-510 NHS ester) (7 mg) in 300 μl of DMF was added to a solution of 3′-O-allyl-dATP-NH₂ (2 mg, 4 mmol) in 300 μl of Na₂CO₃—NaHCO₃ buffer (0.1 M, pH 8.7). The reaction mixture was stirred at room temperature for 3 h. A preparative silica-gel TLC plate was used to purify the crude product (CHCl₃/CH₃OH, 1/1). The resulting product was concentrated under vacuum and further purified with reverse-phase HPLC on a 150×4.6-mm C18 column to obtain the pure product 10 (retention time of 40 min). Mobile phase: A, 8.6 mM triethylamine/100 mM hexafluoroisopropyl alcohol in water (pH 8.1); B, methanol. Elution was performed with 100% A isocratic over 10 min, followed by a linear gradient of 0-50% B for 20 min and then 50% β isocratic over another 20 min. 3′-O-allyl-dATP-allyl-ROX was characterized by single-base extension reaction and MALDI-TOF MS similarly as for 4.

3′-O-allyl-dCTP-allyl-Bodipy-650 (9): Crude Allylic-Bodipy-650 NHS ester (prepared by the same procedure as Allylic-Bodipy-FL-510 NHS ester) (7 mg) in 300 μl of DMF was added to a solution of 3′-O-allyl-dCTP-NH₂ (2 mg, 4 μmol) in 300 μl of Na₂CO₃—NaHCO₃ buffer (0.1 M, pH 8.7). The reaction mixture was stirred at room temperature for 3 h. A preparative silica-gel TLC plate was used to purify the crude product (CHCl₃/CH₃OH, 1/1). The resulting product was concentrated under vacuum and further purified with reverse-phase HPLC on a 150×4.6-mm C18 column to obtain the pure product 10 (retention time of 35 min). Mobile phase: A, 8.6 mM triethylamine/100 mM hexafluoroisopropyl alcohol in water (pH 8.1); B, methanol. Elution was performed with 100% A isocratic over 10 min, followed by a linear gradient of 0-50% B for 20 min and then 50% β isocratic over another 20 min. 3′-O-allyl-dCTP-allyl-Bodipy-650 was characterized by single-base extension reaction and MALDI-TOF MS similarly as for 4.

Synthesis of chemically cleavable linker 4-amino-2-methylene-1-butanol (14):

2-triphenylmethoxymethyl-2-propen-1-ol (10): To a solution of trityl chloride (4.05 g; 14.3 mmol) and 2-methylenepropane-1,3-diol (1.20 mL; 14.3 mmol) in dry CH₂Cl₂ (20 mL) was added triethylamine (4.0 mL; 28.5 mmol) slowly at room temperature. The reaction was stirred at room temperature for 1 h and then ethyl acetate (100 mL) and saturated aqueous NaHCO₃ (30 mL) were added. The organic layer was separated and washed by saturated aqueous NaHCO₃ and NaCl respectively, and dried over anhydrous Na₂SO₄. After evaporation of the solvent, the residue was purified by flash column chromatography over silica gel using ethyl acetate-hexane (1:10˜5) as the eluent to afford 10 as white solid (2.89 g; 62% yield): ¹H NMR (400 MHz, CDCl₃) δ 7.42-7.48 (m, 6H, six of ArH), 7.27-7.33 (m, 6H, six of ArH), 7.20-7.27 (m, 3H, three of ArH), 5.26 (s, 1H, one of C═CH₂), 5.17 (s, 1H, one of C═CH₂), 4.13 (d, J=6.1 Hz, 2H, CH₂OH), 3.70 (s, 2H, Ph₃COCH₂), 1.72 (t, J=6.1 Hz, 1H, CH₂OH); ¹³C NMR (100 MHz, CDCl₃) δ 145.4, 143.6, 128.3, 127.6, 126.8, 111.6, 87.0, 65.3, 64.5, (8).

1-bromo-2-triphenylmethoxymethyl-2-propene (11): To a solution of 10 (2.56 g; 7.74 mmol) in CH₂Cl₂ (75 ml), CBr₄ (3.63 g; 10.83 mmol) and triphenylphosphine (2.47 g; 9.31 mmol) were added respectively at 0° C. and the reaction was stirred at room temperature for 40 min. Cooled to 0° C., ethyl acetate (100 mL) and saturated aqueous NaHCO₃ (30 mL) were added. The organic layer was separated and washed by saturated aqueous NaCl, and dried over anhydrous Na₂SO₄. After evaporation of the solvent, the residue was purified by flash column chromatography over silica gel using CH₂Cl₂-hexane (1:5) as the eluent to afford 11 as white solid (3.02 g; 92% yield): ¹H NMR (400 MHz, CDCl₃) δ 7.42-7.48 (m, 6H, six of ArH), 7.27-7.33 (m, 6H, six of ArH), 7.20-7.27 (m, 3H, three of ArH), 5.37 (s, 1H, one of C═CH₂), 5.31 (s, 1H, one of C═CH₂), 4.01 (s, 2H, CH₂Br), 3.75 (s, 2H, Ph₃COCH₂); ¹³C NMR (100 MHz, CDCl₃) δ 143.6, 142.6, 128.4, 127.6, 126.9, 115.8, 86.9, 64.2, 33.5, (9).

3-triphenylmethoxymethyl-3-butanenitrile (12): To a solution of 11 (1.45 g; 3.69 mmol) and in dry CH₃CN (37 mL) was added trimethylsilyl cyanide (0.49 mL; 3.69 mmol). Then 1 M tetrabutylammonium fluoride (TBAF) in THF solution (3.69 mL, 3.69 mmol) was added slowly at room temperature and the reaction was stirred for 20 min. Most solvents were evaporated and ethyl acetate (100 mL) and saturated aqueous NaHCO₃ (30 mL) were added. The organic layer was washed by saturated aqueous NaCl and dried over anhydrous Na₂SO₄. After evaporation of the solvent, the residue was purified by flash column chromatography over silica gel using acetate-hexane (1:10) as the eluent to afford 12 as white solid (1.01 g; 64% yield): ¹H NMR (400 MHz, CDCl₃) δ 7.39-7.45 (m, 6H, six of ArH), 7.21-7.34 (m, 9H, nine of ArH), 5.31 (s, 2H, C═CH₂), 3.64 (s, 2H, Ph₃COCH₂), 3.11 (s, 2H, CH₂CN); ¹³C NMR (100 MHz, CDCl₃) δ 143.3, 135.5, 128.2, 127.7, 126.9, 116.8, 114.7, 87.0, 65.7, 21.9, (10).

3-triphenylmethoxymethyl-3-buten-1-amine (13): To a solution of LiAlH₄ (119 mg; 2.98 mmol) in dry ether (5 mL), AlCl₃ (400 mg; 2.94 mmol) was added carefully at 0° C. and the mixture was stirred for 15 min. Then a solution of 12 (829 mg; 2.44 mmol) in dry ether (9 mL) was added and the reaction was stirred at 0° C. for 3 h. After that 10% aqueous NaOH (10 mL) was added to quench the reaction. The organic layer was separated and washed by saturated aqueous NaHCO₃ and NaCl respectively, and dried over anhydrous K₂CO₃. After evaporation of the solvent, the residue was purified by flash column chromatography over silica gel using CH₃OH—CH₂Cl₂ (1:20˜5) as the eluent to afford 10 as colorless oil (545 mg; 65% yield): ¹H NMR (400 MHz, CDCl₃) δ 7.41-7.48 (m, 6H, six of ArH), 7.26-7.33 (m, 6H, six of ArH), 7.19-7.26 (m, 3H, three of ArH), 5.33 (s, 1H, one of C═CH₂), 4.96 (s, 1H, one of C═CH₂), 3.53 (s, 2H, Ph₃COCH₂), 2.70 (m, 2H, CH₂CH₂NH₂), 2.18 (t, J=6.7 Hz, 2H, CH₂CH₂NH₂), 2.06 (br s, 2H, NH₂); ¹³C NMR (100 MHz, CDCl₃) δ 143.6, 143.4, 128.1, 127.4, 126.5, 111.3, 86.5, 66.1, 39.8, 37.4; HRMS (FAB+) calcd for C₂₄H₂₆ON (M+H⁺): 344.2014, found: 344.2017, (11).

4-amino-2-methylene-1-butanol (14): To a solution of 13 (540 mg; 1.57 mmol) in CH₃OH (11 mL) was added HCl (2M solution in ether; 5.5 mL) at room temperature and the reaction was stirred for 1 h. Then 7 M methanolic ammonia solution (2.7 mL) was added at room temperature and the reaction was stirred for 10 min. The white solid formed was removed by filtration and washed by CH₃OH. The combined solution was evaporated the residue was purified by flash column chromatography over silica gel using CH₃OH—CH₂Cl₂ (1:4) as the eluent to afford 14 as colorless oil (151 mg; 95% yield): ¹H NMR (400 MHz, CD₃OD) δ 5.19 (s, 1H, one of C═CH₂), 5.01 (s, 1H, one of C═CH₂), 4.06 (s, 2H, CH₂OH), 3.10 (t, J=7.5 Hz, 2H, CH₂CH₂NH₂), 2.46 (t, J=7.5 Hz, 2H, CH₂CH₂NH₂); ¹³C NMR (100 MHz, CD₃OD) δ 145.3, 113.7, 65.5, 39.5, 32.0.

The synthetic procedures of iso-allylic-fluorophore, iso-allylic-fluorophore NHS ester and 3′-O-allyl-dNTP-iso-allyl-fluorophore are the same as described previously for allylic-fluorophore, allylic-fluorophore NHS ester, and 3′-O-allyl-dNTP-allyl-fluorophore, respectively.

iso-Allyl-Bodipy-FL-510 (15):

¹H NMR (400 MHz, CD₃OD) δ 7.42 (s, 1H), 7.00 (d, J=4.0 Hz, 1H), 6.32 (d, J=4.0 Hz, 1H), 6.21 (s, 1H), 5.06 (s, 1H, one of C═CH₂), 4.87 (s, 1H, one of C═CH₂), 4.01 (s, 2H, CH₂OH), 3.33 (t, J=7.5 Hz, 2H), 3.21 (t, J=7.7 Hz, 2H), 2.59 (t, J=7.7 Hz, 2H), 2.51 (s, 3H, one of ArCH₃), 2.28 (s, 3H), 2.26 (t, J=7.1 Hz, 2H)

iso-Allyl-Bodipy-650 (16):

¹H NMR (400 MHz, CD₃OD) δ 8.12 (dd, J=0.9, 3.8 Hz, 1H), 7.63 (m, 3H), 7.54 (d, J=6.4 Hz, 2H), 7.35 (s, 1H), 7.18-7.22 (m, 2H), 7.12 (m, 2H), 7.06 (d, J=8.8 Hz, 2H), 6.85 (d, J=4.2 Hz, 1H), 5.06 (s, 1H, one of C═CH₂), 4.86 (s, 1H, one of C═CH₂), 4.56 (s, 2H), 4.00 (s, 2H, CH₂OH), 3.28 (m, 4H), 2.23 (t, J=7.1 Hz, 2H), 2.14 (t, J=7.5 Hz, 2H), 1.49-1.62 (m, 4H), 1.25-1.34 (m, 2H).

iso-Allyl-R6G (17):

¹H NMR (400 MHz, CD₃OD) δ 8.12 (d, J=8.1 Hz, 1H), 8.05 (dd, J=1.8, 8.1 Hz, 1H), 7.66 (d, J=1.6 Hz, 1H), 7.02 (s, 2H), 6.88 (s, 2H), 5.08 (s, 1H, one of C═CH₂), 4.92 (s, 1H, one of C═CH₂), 4.06 (s, 2H, CH₂OH), 3.48-3.56 (m, 6H), 2.40 (t, J=7.2 Hz, 2H), 2.13 (s, 6H), 1.36 (t, J=7.2 Hz, 6H).

iso-Allyl-ROX (18):

¹H NMR (400 MHz, CD₃OD) δ 8.03 (d, J=8.1 Hz, 1H), 7.98 (dd, J=1.6, 8.1 Hz, 1H), 7.60 (d, J=1.4 Hz, 1H), 6.75 (s, 2H), 5.08 (s, 1H, one of C═CH₂), 4.91 (s, 1H, one of C═CH₂), 4.05 (s, 2H, CH₂OH), 3.45-3.57 (m, 10H), 3.03-3.10 (m, 4H), 2.64-2.73 (m, 4H), 2.38 (t, J=7.1 Hz, 2H), 2.04-2.15 (m, 4H), 1.89-1.99 (m, 4H).

REFERENCES

-   1. Collins, F. S.; Green, E. D.; Guttmacher, A. E.; Guyer, M. S. “A     vision for the future of genomics research”. Nature 2003, 422,     835-847. -   2. Ju, J.; Li, Z.; Edwards, J.; Itagaki, Y. “Massive parallel method     for decoding DNA and RNA”. U.S. patent 2003, U.S. Pat. No.     6,664,079. -   3. Meng, Q.; Kim, D. H.; Bai, X.; Bi, L.; Turro, N. J.; Ju, J.     “Design and synthesis of a photocleavable fluorescent nucleotide     3′-O-allyl-dGTP-PC-Bodipy-FL-510 as a reversible terminator for DNA     sequencing by synthesis”. JACS, submitted -   4. Mitra, R. D.; Shendure, J.; Olejnik, J.; Olejnik, E. K.;     Church, G. M. “Fluorescent in situ sequencing on polymerase     colonies”. Anal. Biochem. 2003, 320, 55-65. -   5. Pleasants, J. C.; Guo, W.; Rabenstein, D. L. “A comparative study     of the kinetics of selenol/diselenide and thiol/disulfide exchange     reactions”. J. Am. Chem. Soc. 1989, 111, 6553-6558. -   6. Huyghues-Despointes, B. M. P.; Nelson, J. W. “Stabilities of     disulfide bond intermediates in the folding of apamin”. Biochemistry     1992, 31, 1476-1483. -   7. Ruparel, H.; Li, Z.; Bai, X.; Kim, D. H.; Turro, N. J.; Ju, J.     “Design and synthesis of a 3′-O-allyl photocleavable fluorescent     nucleotide as a reversible terminator for DNA sequencing by     synthesis”. Proc. Natl. Acad. Sci. USA 2005, 102, 5932-5937. -   8. Kang, J.-H.; Siddiqui, M. A.; Sigano, D. M.; Krajewski, K.;     Lewin, N. E.; Pu, Y.; Blumberg, P. M.; Lee, J. and Marquez, V. E.     Conformationally constrained analogues of diacylglycerol. 24.     Asymmetric synthesis of a chiral (R)-DAG-Lactone template as a     versatile precursor for highly functionalized DAG-Lactones. Org.     Lett. 2004, 6(14), 2413-2416 -   9. Honma, M.; Sawada, T.; Fujisawa, Y.; Utsugi, M.; Watanabe, H.;     Umino, A.; Matsumura, T.; Hagihara, T.; Takano, M. and Nakada, M.     Asymmetric catalysis on the intramolecular cyclopropanation of     α-diazo-β-keto Sulfones J. Am. Chem. Soc. 2003, 125(10), 2860-2861. -   10. Soli, E. D.; Manoso, A. S.; Patterson, M. C. and DeShong, P.     Azide and cyanide displacements via hypervalent silicate     intermediates J. Org. Chem. 1999, 64(9), 3171-7. -   11. Jacobson, M. A. and Williard, P. G. Generation of     1-azapentadienyl anion from     N-(tert-butyldimethylsilyl)-3-buten-1-amine J. Org. Chem. 2002,     67(11), 3915-3918. 

1. A nucleotide analogue comprising (i) a base selected from the group consisting of adenine or an analogue of adenine, guanine or an analogue of guanine, cytosine or an analogue of cytosine, thymine or an analogue of thymine and uracil or an analogue of uracil, (ii) a deoxyribose, (iii) an allyl moiety bound to the 3′-oxygen of the deoxyribose and (iv) a fluorophore bound to the base via an allyl linker.
 2. The nucleotide analogue of claim 1, wherein the nucleotide analogue is an analogue of DATP, dGTP, dCTP or dUTP.
 3. The nucleotide analogue of claim 1, wherein the fluorophore is selected from the group consisting of ROX, Bodipy-FL-510, Bodipy-650 and R6G.
 4. The nucleotide analogue of claim 1, wherein the fluorophore is bound to the base via an iso-allyl linker.
 5. The nucleotide analogue of claim 4, wherein the nucleotide analogue is selected from the group consisting of 3′-O-allyl-dGTP-iso-allyl-Bodipy-FL-510, 3′-O-allyl-dCTP-iso-allyl-Bodipy-650, 3′-O-allyl-dATP-iso-allyl-ROX and 3′-O-allyl-dUTP-iso-allyl-R6G.
 6. The nucleotide analogue of claim 1, wherein the nucleotide analogue is selected from the group consisting of 3′-O-allyl-dGTP-allyl-Bodipy-FL-510, 3′-O-allyl-dCTP-allyl-Bodipy-650, 3′-O-allyl-dATP-allyl-ROX and 3′-O-allyl-dUTP-allyl-R6G.
 7. A method for making a nucleotide analogue wherein the nucleotide analogue comprises (i) a base selected from the group consisting of adenine or an analogue of adenine, guanine or an analogue of guanine, cytosine or an analogue of cytosine and uracil or an analogue of uracil, (ii) a deoxyribose, (iii) an allyl moiety bound to the 3′ oxygen of the deoxyribose, and (iv) a fluorophore bound to the base via an allyl linker which is not an iso-allyl linker, comprising the steps of: (a) contacting 6-amino-hex-2-en-1-ol and an N-hydroxysuccinimide ester of a fluorophore in the presence of a first suitable solvent and a suitable base; (b) treating the resulting product of step (a) with DSC/Et₃N in a second suitable solvent; and (c) treating the resulting product of step (b) with a 3′-O-allyl-dNTP-NH₂ in the presence of a suitable buffered solvent, wherein the base of the 3′-O-allyl-dNTP-NH₂ is an adenine, guanine, cytosine, uracil, or an analogue thereof, thereby making the nucleotide analogue.
 8. The method of claim 7, wherein the steps comprise those set forth in FIG. 2, scheme A; FIG. 2, scheme B; FIG. 2, scheme C; or FIG. 2, scheme D.
 9. The method of claim 7, wherein the first suitable solvent is DMF and the second suitable solvent is DMF.
 10. The method of claim 7, wherein the first suitable solvent is acetonitrile and the second suitable solvent is DMF.
 11. The method of claim 7, wherein the suitable base is NaHCO₃.
 12. The method of claim 7, wherein the suitable buffered solvent is DMF buffered with NaHCO₃—Na₂CO₃.
 13. The method of claim 7, wherein the suitable buffered solvent is acetonitrile buffered with NaHCO₃—Na₂CO₃.
 14. A method for making a nucleotide analogue wherein the nucleotide analogue comprises (i) a base selected from the group consisting of adenine or an analogue of adenine, guanine or an analogue of guanine, cytosine or an analogue of cytosine and uracil or an analogue of uracil, (ii) a deoxyribose, (iii) an allyl moiety bound to the 3′ oxygen of the deoxyribose, and (iv) a fluorophore bound to the base via an allyl linker, comprising the steps of: (a) contacting 2-(2-amino-ethyl)-prop-2-en-1-ol and an N-hydroxysuccinimide ester of a fluorophore in the presence of a first suitable solvent and a suitable base; (b) treating the resulting product of step (a) with DSC/Et₃N in a second suitable solvent; and (c) treating the resulting product of step (b) with a 3′-O-allyl-dNTP-NH₂ in the presence of a suitable buffered solvent, wherein the base of the 3′-O-allyl-dNTP-NH₂ is an adenine, guanine, cytosine, uracil, or an analogue thereof, thereby making the nucleotide analogue.
 15. The method of claim 14, wherein the steps comprise those set forth in FIG. 6, scheme A; FIG. 6, scheme B; FIG. 6, scheme C; or FIG. 6, scheme D.
 16. The method of claim 14, wherein the first suitable solvent is DMF and the second suitable solvent is DMF.
 17. The method of claim 14, wherein the suitable base is NaHCO₃.
 18. The method of claim 14, wherein the suitable buffered solvent is DMF buffered with NaHCO₃—Na₂CO₃.
 19. A method for determining the sequence of a DNA comprising performing the following steps for each residue of the DNA to be sequenced: (a) contacting the DNA with a DNA polymerase in the presence of (i) a primer and (ii) four fluorescent nucleotide analogues under conditions permitting the DNA polymerase to catalyze DNA synthesis, wherein (1) the nucleotide analogues consist of an analogue of dGTP, an analogue of dCTP, an analogue of dTTP or dUTP, and an analogue of DATP, (2) each nucleotide analogue comprises (i) a base selected from the group consisting of adenine, guanine, cytosine, thymine or uracil, and analogues thereof, (ii) a deoxyribose, (iii) an allyl moiety bound to the 3′-oxygen of the deoxyribose and (iv) a fluorophore bound to the base via an allyl linker, so that a nucleotide analogue complementary to the residue being sequenced is bound to the DNA by the DNA polymerase, and (3) each of the four analogues has a predetermined fluorescence wavelength which is different than the fluorescence wavelengths of the other three analogues; (b) removing unbound nucleotide analogues; (c) determining the identity of the bound nucleotide analogue; and (d) following step (c), except with respect to the final DNA residue to be sequenced, chemically cleaving from the bound nucleotide analogue the fluorophore and the allyl moiety bound to the 3′-oxygen atom of the deoxyribose, thereby determining the sequence of the DNA.
 20. The method of claim 19, wherein chemically cleaving the fluorophore and the allyl moiety bound to the 3′-oxygen atom is performed using Na₂PdCl₄.
 21. The method of claim 19, wherein the primer is a self-priming moiety.
 22. The method of claim 19, wherein the DNA is bound to a solid substrate.
 23. The method of claim 22, wherein the DNA is bound to the solid substrate via 1,3-dipolar azide-alkyne cycloaddition chemistry.
 24. The method of claim 22, wherein about 1000 or fewer copies of the DNA are bound to the solid substrate.
 25. The method of claim 19, wherein the four fluorescent nucleotide analogues are 3′-O-allyl-dGTP-iso-allyl-Bodipy-FL-510, 3′-O-allyl-dCTP-iso-allyl-Bodipy-650, 3′-O-allyl-dATP-iso-allyl-ROX and 3′-O-allyl-dUTP-iso-allyl-R6G.
 26. The method of claim 19, wherein the four fluorescent nucleotide analogues are 3′-O-allyl-dGTP-allyl-Bodipy-FL-510, 3′-O-allyl-dCTP-allyl-Bodipy-650, 3′-O-allyl-dATP-allyl-ROX and 3′-O-allyl-dUTP-allyl-R6G.
 27. The method of claim 19, wherein the DNA polymerase is a 9°N polymerase.
 28. A kit for performing the method claim 19, comprising, in separate compartments, (a) a nucleotide analogue of (i) GTP, (ii) ATP, (iii) CTP and (iv) TTP or UTP, wherein each analogue comprises (i) a base selected from the group consisting of adenine, guanine, cytosine, thymine or uracil, or an analogue thereof, (ii) a deoxyribose, (iii) an allyl moiety bound to the 3′-oxygen of the deoxyribose and (iv) a fluorophore bound to the base via an allyl linker, (b) reagents suitable for use in DNA polymerization; and (c) instructions for use.
 29. The kit of claim 28, wherein the kit comprises 3′-O-allyl-dGTP-allyl-Bodipy-FL-510, 3′-O-allyl-dCTP-allyl-Bodipy-650, 3′-O-allyl-dATP-allyl-ROX and 3′-O-allyl-dUTP-allyl-R6G.
 30. The kit of claim 28, wherein the kit comprises 3′-O-allyl-dGTP-iso-allyl-Bodipy-FL-510, 3′-O-allyl-dCTP-iso-allyl-Bodipy-650, 3′-O-allyl-dATP-iso-allyl-ROX and 3′-O-allyl-dUTP-iso-allyl-R6G.
 31. A method for covalently affixing a detectable moiety, via an allyl linker, to an NH₂-bearing molecule, comprising contacting the detectable moiety with the NH₂-bearing molecule in the presence of a suitable solvent and suitable base, wherein the detectable moiety comprises a mass tag, fluorophore or chromophore bound to a NHS ester of an allyl moiety.
 32. The method of claim 31, wherein the NH₂-bearing molecule is a nucleotide and the detectable moiety comprises a fluorophore. 